Emscripten build (demo, quick and dirty) #12

ggerganov · 2023-07-23T20:26:35Z

This is a very ugly hack to demonstrate build with Emscripten

Not intended to merge as it will just ruin the beauty of the implementation and most likely there is a much better way to do this. Mainly for educational purposes.

emcc -O3 run.c \
  -o web/llama2.js \
  -s EXPORTED_FUNCTIONS='["_main", "_main_loop", "_malloc", "_free"]' \
  -s EXPORTED_RUNTIME_METHODS='["ccall"]' \
  -s ALLOW_MEMORY_GROWTH=1 \
  --preload-file model.bin \
  --preload-file vocab.bin

The vocab.bin is generated in a similar way explained in #9

The produced artifacts will be generated in the web subfolder.

Example: https://ggerganov.com/llama2.c

alexeykudinkin

Worth merging IMO even if not for Emscripten, then for cleaning up the main sequence at the very least

alexeykudinkin · 2023-07-23T21:29:16Z

run.c

+        checkpoint = "model.bin";
+        FILE *file = fopen(checkpoint, "rb");
+        if (!file) {
+            printf("Unable to open file!");


Suggested change

printf("Unable to open file!");

printf("Unable to open file!");

return 1;

alexeykudinkin · 2023-07-23T21:29:31Z

run.c

+    {
+        FILE *file = fopen("vocab.bin", "r");
+        if (!file) {
+            printf("Unable to open file!");


Suggested change

printf("Unable to open file!");

printf("Unable to open file!");

return 1;

karpathy · 2023-07-23T22:38:45Z

Very cool ty! I'm not as familiar with emscripten, I'll take some time to look at this and whether there is a way to support it gracefully.

karpathy · 2023-07-24T01:34:19Z

@ggerganov one thing I really like is the tokenizer inside C here, removing the need to use the run_wrap.py and sentencepiece. But it's not clear how you produced vocab.bin, presumably it's an export script?

karpathy · 2023-07-24T04:03:45Z

@ggerganov one thing I really like is the tokenizer inside C here, removing the need to use the run_wrap.py and sentencepiece. But it's not clear how you produced vocab.bin, presumably it's an export script?

nvm got it working in 3bfa566

ggerganov · 2023-07-24T04:30:55Z

Yes, for this PR I had exported the vocab from llama.cpp, which has been generated with a similar script as yours.

I'm not sure about the cause of the leading space - we have it in llama.cpp output as well and haven't dug to understand how to fix it yet.

kroggen · 2023-07-26T00:58:28Z

The tokenizer stores a leading space on some tokens.

In python we use the Decode function, that removes the leading space on the first token

But this repo is using the id_to_piece() function to export the tokenizer

python3
>>> import sentencepiece as spm
>>> sp = spm.SentencePieceProcessor(model_file='tokenizer.model')
>>> sp.id_to_piece([9038])
['▁Once']

Then it is simply printed, without considering if it is the first token:

llama2.c/run.c

Line 471 in f565089

printf("%s", vocab[next]);

The PR #89 fixes it

gohai · 2023-07-28T05:57:00Z

I recreated @ggerganov's changes against the current tree (with support for prompt): https://github.com/gohai/llama2.c/commits/emscripten

rahuldshetty · 2023-08-16T07:55:35Z

I've built a JS framework around this idea to run Language Models on Web.
This also leverages Emscripten to compile llama2.c into WASM JS.

llama2.c on Web Demo: https://rahuldshetty.github.io/ggml.js-examples/llama2_tinystories.html
ggml.js Framework: https://github.com/rahuldshetty/ggml.js

emcc : quick n dirty hack to build with Emscripten

a93022c

alexeykudinkin reviewed Jul 23, 2023

View reviewed changes

python273 mentioned this pull request Jul 23, 2023

Tokenizer in C #15

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Emscripten build (demo, quick and dirty) #12

Emscripten build (demo, quick and dirty) #12

ggerganov commented Jul 23, 2023

alexeykudinkin left a comment

alexeykudinkin Jul 23, 2023

alexeykudinkin Jul 23, 2023

karpathy commented Jul 23, 2023

karpathy commented Jul 24, 2023

karpathy commented Jul 24, 2023

ggerganov commented Jul 24, 2023

kroggen commented Jul 26, 2023 •

edited

Loading

gohai commented Jul 28, 2023

rahuldshetty commented Aug 16, 2023

	printf("Unable to open file!");
	printf("Unable to open file!");
	return 1;

Emscripten build (demo, quick and dirty) #12

Are you sure you want to change the base?

Emscripten build (demo, quick and dirty) #12

Conversation

ggerganov commented Jul 23, 2023

alexeykudinkin left a comment

Choose a reason for hiding this comment

alexeykudinkin Jul 23, 2023

Choose a reason for hiding this comment

alexeykudinkin Jul 23, 2023

Choose a reason for hiding this comment

karpathy commented Jul 23, 2023

karpathy commented Jul 24, 2023

karpathy commented Jul 24, 2023

ggerganov commented Jul 24, 2023

kroggen commented Jul 26, 2023 • edited Loading

gohai commented Jul 28, 2023

rahuldshetty commented Aug 16, 2023

kroggen commented Jul 26, 2023 •

edited

Loading